Adaptive Front-ends for End-to-end Source Separation
نویسندگان
چکیده
Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. We present an auto-encoder neural network that can act as an equivalent to short-time front-end transforms. We demonstrate the ability of the network to learn optimal, real-valued basis functions directly from the raw waveform of a signal and further show how it can be used as an adaptive front-end for end-to-end supervised source separation.
منابع مشابه
End-to-end Source Separation with Adaptive Front-Ends
Source separation and other audio applications have traditionally relied on the use of short-time Fourier transforms as a front-end frequency domain representation step. The unavailability of a neural network equivalent to forward and inverse transforms hinders the implementation of end-to-end learning systems for these applications. We present an auto-encoder neural network that can act as an ...
متن کاملMatching the Acoustic Model to Front-End Signal Processing for ASR in Noisy and Reverberant Environments
Distant-talking automatic speech recognition (ASR) represents an extremely challenging task. The major reason is that unwanted additive interference and reverberation are picked up by the microphones besides the desired signal. A hands-free human-machine interface should therefore comprise a powerful acoustic preprocessing unit in line with a robust ASR back-end. However, since perfect speech e...
متن کاملBlind IQ-Imbalance Compensation Using Iterative Inversion for Arbitrary Direct Conversion Receivers
Besides low-IF techniques, the direct down conversion or homodyne reception gains more and more attention in academical and industrial research. In this article we describe both, multiplicative and additive mixing within analog direct down conversion front-ends. The effect of gain and phase imbalances of the local oscillator as well as signal path mismatches within the analog front-end lead to ...
متن کاملRobust automatic speech recognition using a multi-channel signal separation front-end
A multi-channel signal separation front-end for robust automatic speech recognition under time-varying interference conditions is developed. The speech signals acquired by a dual-channel system are restored by adaptive decorrelation filtering, and then examined by a time-domain or frequency-domain source signal detection technique to determine the active regions of each source signal. The front...
متن کاملPerformance of an adaptive homodyne receiver in the presence of multipath, rayleigh-fading and time-varying quadra - Circuits and Systems, 2003. ISCAS '03. Proceedings of the 2003 International Symposium on
In this paper, we carry out a detailed performance analysis of the blind source separation based I/Q corrector operating at the baseband. Performance of the digital I/Q corrector is evaluated not only under time-varying phase and gain errors but also in the presence of multipath and Rayleigh fading channels. Performance under low-SNR and different modulation formats and constellation sizes is a...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017